PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopen08g026800.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family HD-ZIP
Protein Properties Length: 279aa    MW: 31556.4 Da    PI: 7.7374
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopen08g026800.1genomespennView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox58.79.3e-19129183256
                       T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
          Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                       rk+ +++k+q  +Lee F+++++++ +++  LAk+lgL  rqV vWFqNrRa+ k
  Sopen08g026800.1 129 RKKLRLSKDQSAILEESFKEHNTLNPKQKLALAKRLGLRPRQVEVWFQNRRARTK 183
                       788899***********************************************98 PP

2HD-ZIP_I/II129.11.8e-41129218191
       HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLreel 91 
                       +kk+rlsk+q+++LEesF+e+++L+p++K +la++Lgl+prqv+vWFqnrRARtk+kq+E+d+e+Lkr++++l+een+rL+kev+eLr +l
  Sopen08g026800.1 129 RKKLRLSKDQSAILEESFKEHNTLNPKQKLALAKRLGLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCENLTEENRRLQKEVQELR-AL 218
                       69*************************************************************************************9.55 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF046182.1E-322104IPR006712HD-ZIP protein, N-terminal
Gene3DG3DSA:1.10.10.606.9E-18116179IPR009057Homeodomain-like
SuperFamilySSF466892.01E-18121186IPR009057Homeodomain-like
PROSITE profilePS5007117.329125185IPR001356Homeobox domain
SMARTSM003893.0E-16127189IPR001356Homeobox domain
PfamPF000463.5E-16129183IPR001356Homeobox domain
CDDcd000867.10E-15129186No hitNo description
PRINTSPR000312.7E-5156165IPR000047Helix-turn-helix motif
PROSITE patternPS000270160183IPR017970Homeobox, conserved site
PRINTSPR000312.7E-5165181IPR000047Helix-turn-helix motif
PfamPF021832.1E-11185219IPR003106Leucine zipper, homeobox-associated
SMARTSM003404.2E-27185228IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0008283Biological Processcell proliferation
GO:0009641Biological Processshade avoidance
GO:0009733Biological Processresponse to auxin
GO:0009735Biological Processresponse to cytokinin
GO:0009826Biological Processunidimensional cell growth
GO:0010016Biological Processshoot system morphogenesis
GO:0010017Biological Processred or far-red light signaling pathway
GO:0010218Biological Processresponse to far red light
GO:0045892Biological Processnegative regulation of transcription, DNA-templated
GO:0048364Biological Processroot development
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0042803Molecular Functionprotein homodimerization activity
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 279 aa     Download sequence    Send to blast
MMVEKEDLGL SLSLSFPDNN NNNKKNTQIN LSPFNLIQKT AWSDSLFPSS DRNIETCRVE  60
TRTFLKGIDV NRLPATGDAD EEAGVSSPNS TISSVSGNKR SEREANNCDQ EEHEMERGGS  120
DEEDGETSRK KLRLSKDQSA ILEESFKEHN TLNPKQKLAL AKRLGLRPRQ VEVWFQNRRA  180
RTKLKQTEVD CEFLKRCCEN LTEENRRLQK EVQELRALKL SPQFYMQMTP PTTLTMCPSC  240
ERVAGPPSSS SGPTSTPMGQ AQPRPMPFNL WANALHPRS
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1127133SRKKLRL
2177185RRARTKLKQ
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754471e-175HG975447.1 Solanum pennellii chromosome ch08, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_015083748.10.0PREDICTED: homeobox-leucine zipper protein HAT4-like
SwissprotQ054661e-108HAT4_ARATH; Homeobox-leucine zipper protein HAT4
TrEMBLA0A0V0HV350.0A0A0V0HV35_SOLCH; Putative homeobox-leucine zipper protein HAT4-like
STRINGSolyc08g078300.2.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA11182485
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G16780.11e-103homeobox protein 2